Marker Genotype Imputation in a Low- Marker-Density Panel with a High-Marker- Density Reference Panel: Accuracy Evaluation in Barley Breeding Lines

نویسندگان

  • Hiroyoshi Iwata
  • Jean-Luc Jannink
چکیده

We evaluated a strategy in which the scores of markers untyped in a low-density experimental panel were imputed on the basis of data from a high-density reference panel, in its application to whole-genome genotyping of barley (Hordeum vulgare L.) breeding lines. Using a barley core set consisting of 98 lines genotyped with 3205 markers (high-density reference panel), we imputed marker scores untyped in 863 barley breeding lines genotyped with 1330 common markers (low-density experimental panel). In repeated analyses, the scores of one common marker were masked in the experimental panel, and then imputed as an untyped marker. Imputation accuracy was evaluated by comparing imputed scores with true ones. The correct imputation rate was >0.9 in 92% of markers. The square of correlation coeffi cient between true genotypes and mean imputed genotypes was >0.6 in 90% of the markers. Factors affecting imputation accuracy were minor allele frequency, linkage disequilibrium with neighbor common markers, minimum distance to the closest common marker, and degree of differentiation among subpopulations. Actual quantitative trait loci (QTL) would be unobserved in both reference and experimental panels. Markers masked in both panels to mimic this situation sometimes showed larger correlation to imputed markers than to typed common markers, indicating that imputation can sometimes capture the variation of unknown QTL better than the genotypes of common markers. H. Iwata, National Agriculture and Food Research Organization, National Agricultural Research Center, Tsukuba, Ibaraki 305-8666, Japan; J.-L. Jannink, USDA-ARS, R.W. Holley Center for Agriculture and Health, Cornell Univ., Ithaca, NY 14853. H. Iwata, current address: Univ. of Tokyo, Graduate School of Agricultural and Life Sciences, Bunkyo, Tokyo 113-8657, Japan. Received 9 Aug. 2009. *Corresponding author ([email protected]). Abbreviations: BOPA, barley oligonucleotide pool assay; CAP, Coordinated Agricultural Project; DArT, diversity arrays technology; LD, linkage disequilibrium; MAF, minor allele frequency; POPA, pilot oligonucleotide pool assay; QTL, quantitative trait loci; SNP, single nucleotide polymorphism. Published in Crop Sci. 50:1269–1278 (2010). doi: 10.2135/cropsci2009.08.0434 Published online 20 Apr. 2010. © Crop Science Society of America | 5585 Guilford Rd., Madison, WI 53711 USA All rights reserved. No part of this periodical may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Permission for printing and for reprinting the material contained herein has been obtained by the publisher.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of genotype imputation accuracy using reference populations with varying degrees of relationship and marker density panel

Genotype imputation from low-density to high-density (SNP) chips is an important step before applying genomic selection, because denser chips can provide more reliable genomic predictions. In the current research, the accuracy of genotype imputation from low and moderate-density panels (5K and 50K) to high-density panels in the purebred and crossbred populations was assessed. The simulated popu...

متن کامل

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

Comparing Different Marker Densities and Various Reference Populations Using Pedigree-Marker Best Linear Unbiased Prediction (BLUP) Model

In order to have successful application of genomic selection, reference population and marker density should be chosen properly. This study purpose was to investigate the accuracy of genomic estimated breeding values in terms of low (5K), intermediate (50K) and high (777K) densities in the simulated populations, when different scenarios were applied about the reference populations selecting. Af...

متن کامل

Imputation of genotypes from low- to high-density genotyping platforms and implications for genomic selection.

The objective of this study was to quantify the accuracy achievable from imputing genotypes from a commercially available low-density marker panel (2730 single nucleotide polymorphisms (SNPs) following edits) to a commercially available higher density marker panel (51 602 SNPs following edits) in Holstein-Friesian cattle using Beagle, a freely available software package. A population of 764 Hol...

متن کامل

Imputation of missing single nucleotide polymorphism genotypes using a multivariate mixed model framework.

The objective of this paper was to investigate, for various scenarios at low and high marker density, the accuracy of imputing genotypes when using a multivariate mixed model framework using information from 2, 4, or 10 surrounding markers. This model predicts genotypes at a locus, using genotypes at nearby loci as correlated traits, and the additive genetic relationship matrix to use informati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010